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SUMMARY 

This paper reports on the ongoing efforts to model the human operator in 
the context of the task during the enroute/return phases in the ground based 
control of multiple flights of remotely piloted vshicles (RPV), This is a part 
of our research aimed at investigating human performance models and at modeling 
command and control systems.* 

The approach employed here uses models that have their analytical bases in 
control theory and in statistical estimation and decision theory. In 
particular, it draws heavily on the models and the concepts of the optimal 
control model (OCM) of the human operator. We are in ihe process of extending 
the OCK into a combined monitoring, decision, and control model (DEMON) of the 
human operator by infusing Decision theoretic notions that make it suitable for 
application to problems in which human control actions are infrequent and in 
which monitoring and decision-making are the operator's main activities. Some 
results obtained with a specialized version of DEMON for the RPV control problem 
are included. 

1. INTRODUCTION 


1.1 Modeling Goals 

We are involved in a program of research aimed at investigating human 
oerformance models and approaches to modeling command and control systems (see 
reference 1 ) . A part of our research effort concerns the study of the 
feasibility of modeling the human operators in command and control systems via 
control and decision theoretic models. This paper describes the salient aspects 
of this part of our ongoing research effort. 


1.2 Modeling Approach 

The approach employed here uses models that have their analytical bases in 
control theory and in statistical estimation and decision theory. In 
oarticular, it draws heavily on the models and concepts of the OCM (references 
2-6). The modeling approach is normative, in that one determines what the human 
operator ought to do, given the system objectives and the operator's 
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limitations, and this serves as a prediction of what well- trained, motivated 
operators will do. 

In the basic OCM concern is more with the operator’s continuous interact, j 
with the system, as demanded by closed loop analysis, than with his response to 
discrete events. The development of the basic OCM and its model structure has 
been dictated by the principal areas of its previous application, viz., vehicle 
control. We shall extend the OCM by incorporating structures and notions that 
make it suitable for application to problems in which human control actions are 
infrequent and in which monitoring and decision-making are the operator's main 
activities.* The expected end product is a combined monitoring, decision, and 
control model for the human operator in a command and control task. 


1.-3 Task definition 

In this paper we shall discuss our modeling effort as it relates to the 
task facing the human operator during the enroute/return phases in the ground 
based control of multiple flights of remotely piloted vehicles (RPV). 

The enroute/return phases together with a terminal control phase 
constitute an "RPV mission". An RPV -mission consists of coordinated flights of 
several RPV-triads. Each triad has a strike vehicle (S), an electronics 
countermeasures vehicle (E) and a low- reconnaisance vehicle (L). Each RPV is 
automatically controlled along a pre-programmed flight plan assumed optimal 
with respect to terrain and defenses. The RPVs deviate from their flight plan 
due to navigation system errors, position reporting errors, communication 
jamming by the enemy, equipment malfunctions etc. These deviations are kept in 
check by external monitoring and control from the ground station. This 
supervision is provided by human enroute controllers, who are equipped with CRT 
displays for monitoring flight path and vehicle status and with keyboards and 
light pens for introducing changes in RPV flight parameters. The ultimate 
objective of the enroute controllers is to ensure that the S and E RPVs fly on 
schedule over the target 15 seconds apart followed by the L RPV two minutes 
later to assess damage. This time-phasing at the target is accomplished by 
time-phased handoffs at designated hand-off coordinates on the flight plan. The 
S RPVs are handed off to the terminal controller (pilot) equipped with a 
televised view from the nose of the RPV and with standard aircraft controls and 
displays in order to direct each vehicle to a speoific designated target, 
release its payload, and hand it back to one of the enroute controllers. 

Terminal phase control is achieved only if the S RPV is within a 1500* 
corridor around its flight plan. It is the responsibility of the enroute 
operator to command "patches" to alter the flight plan as necessary to achieve 
terminal phase control. These patches are acceptable ("GO") only if they 
satisfy constraints such as turning radius, available fuel, command link status 
etc. 


* This type of extension is feasible beoause of the basic information processing 
structure of the OCM. Indeed, there have already been applications of OCM to 
account for visual scanning( references 7,8) and decision makinK( references 
9 , 10 ). 


PTii summary, the enroute operator’s task is to monitor the trajectories and 
TAs of N vehicles, to decide if the lateral deviation or ETA error of any of 
ese exceeds some threshold, and to correct the paths of those that deviate 
excessively by issuing acceptable patches. 

2. THE O.OSED LOOP MODEL 

A block diagram modeling the flow of information and the control and 
Figure°”^ encountered by the human operator (enroute operator) is shown in 
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Figure 1. Block Diagram for RPV Monitoring/Control Decision Problem 

thaf DCF (Drone control facility) contains the stored flight plans 

that drive the N subsystems RPVi, i=1,2,...,N. They are usually "optimal" with 
Current terrain and other information. We will assume they can be 
computed using state-variable equations. ^ 

System: The N RPVs undergoing monitoring/ control constitute the system. A 
simple non-linear representation of their dynamic behavior will be assumed for 
analysis. Linearization will be carried out if necessary for 
implementation of the model. The true status x^ of the i-th RPV isav be 

different from the stored flight plans due to "disturbances" w^. The reported 
status yi will be different from the true status xi due to reportlnrerrcrvi 
The observed status ji^p will depend on the reported status y- and on tL 
monitoring strategy" (to be discussed later on). The disturbances wi and 
reporting error v^ will be modeled by suitable random processes. The yi are the 
displayed variables corresponding to RPVi. r . me y are tne 


649 





Monitoring Strategy: Since the human must decide which RPV or which 
display to look at, he needs to develop a monitoring strategy. This is 
important because his estimates of the true status of each RPV (and hence his 
patch decision strategy) will depend upon his monitoring strategy. To account 
for the interaction of the patch decision strategy with the monitoring strategy 
we formulate and solve a combined monitoring and patching decision problem 
(Appendix B has the details) . 

Monitoring strategies may be distinguished by whether they predict 
temporal (time histories of) monitoring behaviour or average monitoring 
behaviour over some chosen time horizon. Most of the earlier work in the 
literature, including that with the OCM, falls in the latter category. The 
monitoring strategy we derive will predict temporal behaviour which can be 
simulated . Some of the monitoring strategies derived in the literature which 
we expect to investigate in the DEMON setup are; 

(1) A simple strategy involving cyclical processing of the various 
RPVs( reference 11). 

(ii) A strategy generalizing the Queueing Theory Sampling Hcdel (reference 
12) , which would minimize the total cost of not looking at a particular RPV 
at a given time. This strategy is mainly useful for maintaining lateral 
deviations within allowable limits. The costs for errors and for the 
different RPVs would be functions of the time-to-go and, possibly, RPV 
type. 

(iii) A strategy of sampling when the probability that the signal exceeds 
some prescribed limit is greater than a subjective probability 
threshold (references 13,14). 

(iv) A strategy aimed at minimizing total estimation error (reference 7). 
This strategy would be consistent with monitoring for the purpose of 
minimizing lateral deviation errors. 

Information Processor: This block models the processing that goes on in 
the human operator to produce the current estimate of the true RPV status from 
past observed status. This block is the well known control- theoretic model 
consisting of a Kalman filter-predictor which produces the maximum-likelihood, 
least-squares estimate x =(x1, x2,. ., x^) of the true status x of all the 
RPVs. It also produces the variance of the error in that estimate. (Note that an 
estimate of the state of each RPV is maintained synchronously at all times. 
Observation of a particular RPV improves the accuracy of the estimate of the 
status of that RPV while uncertainty about the status of the remaining, 
unobserved x-ehicles increases.) Given the assumptions generally made for this 
kind of analysis, the information processor can thus generate the conditional 
density of x based on the past observations y. 

Decision Strategy: This block models the process of deciding which, if 
any, RPV to patch. We consider the decision process to be discrete (it takes 5 
sec to get a new display). The cost of making a patch would reflect the lost 
opportunity to monitor and/or patch other RPVs as well as breaking 
radio-silence; the gain (negative cost) is the presumed reduction in error for 
the "patched" vehicle. The decision strategy attempts to minimize the 
(expected) cost. This block translates the best estimate x ••nto a decision to 

(i) command a patch to one of the RPVs and/or (il) modify the future monitoring 
strategy. ® 


Patch Coonand Generator: This block generates the commanded patch.. We 

shall investigate a strategy based on minimizing a weighted sum of the time to 
return to the desired path and the total mean-square tracking error. The 
allowable paths would be constrained by the RPV turning radius limits. Random 
execution errors would be added to the commanded patch to represent human 
errors . 

Patch Check: This consists of a GO/NO GO check on the patch using 

conditions on turning radius, coiranand link status, etc. 

3. MATHEMATICAL DETAILS OP THE MODEL 

3* 1 System 

The system under study consists of the N-RPV subsystems and may be 
described by the state equations:* 

i s kx + dBu +Ew +Fz ,x(tQ) = xq (1) 

where the state vector x includes the states x^ of the N-RPV subsystems. Here d 
is a vector of decision variables (to be explainsd below) and z is a non-random 
input vector which will be used to model non-zero means of the random inputs w 
as well as any predetermined command inputs. In the present RPV context z will 
be used to generate the flight plan for the RPVs. The vector u denotes the 
patch control input to the RPVs. In partitioned form equation (1) appears as 
follows: 
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For the system under study, the following observations hold: 

Al: Only one of the N-RPV subsystems may be controlled by the 

patch-control u at any given time. A decision to control the i-th RPV subsystem 
then implies the following conditions on the decision variables: 

di s 1 , dj = 0 , J i (3) 

A2: The N-RPV subsystems are decoupled (except for the interdependence of 

the decision variables via (3)), that is. 


* For the purpose of dlscussi'>n, a linear model is assumed. In actual 
implementation, we may use a simple non-linear model in which case (1) would 
represent a linear perturbation equation for the system about some nominal 
trajctory. 
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( 4 ) 


AiJ = 0, fiiJ = 0 , PiJ = 0, 

The N-RPV subsystems may thus be described by 
ii = Aii xi + diBiu + + pi^zi, x^Cto) = 

dj^ = 0 or 1 
J di = 1 or 0 


(5a) 

(5b) 

(5c) 


3.2 Flight Plan (DCF) 

When there is no disturbance and no (patch) control u then the N RPV 
subsystems follow the flight plan x 

x^ = A^^ x^ +F^^z^, x^(tQ) = x^ 

Flight plans made up of straight lines are easily generated using a piecewise 
constant time function for z^ and xj as the launch point. 


3.3 Patching 

Any disturbance wi causes the i-th RPV to deviate 
Denoting these deviations by e^ = xi - xi it follows from (5) and (6) that 

^i . Aii et + diBiu + , et(to) = xj-x{) <7a) 


di = 0 or 1 
I di = 1 or 0 


(7b) 

(7c) 


It is the purpose of the (patch) control u to correct any such if 55S 

wt is an unknown random disturbance and d^ is nonzero ^r at “°®7oeratS thus 
subsvstem. it is not possible to maintain et=0 for all i. The 
faoS lte patching prohlem which conalata of the following three aub-proble«s. 
(1) Monitoring decision - which RPV to monitor? 

(ii) Patching decision - whether to patch the monitored RPV? 

(iii) Patch computation - what patch command to issue? 

3.3.1 Monitoring Decision 

As mentioned before, the monitoring "pol! 

the natchine decision because it restricts the available patching options, tor 
SLpfef l5f the BPV cohtext only a monitored BPV can be patched. The 

combined monitoring and patching decision problem la analyaed In appendix B. 
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3*3«2 Patching Decision 

A patching decision consists of deciding if the monitored RPV subsystem is 
to be patched. At most one of the RPVs may be patched at a given time. One 
idea of patching is to reduce deviations from the flight plan to below some 
threshold values. Some facts to note are: 

(i) Cross- track error of less than 250* is desired for type-S RPVs 
(ii) Terminal-phase control not possible if cross track error exceeds 1500' 
We assume a normative model, in which the operator attempts to optimize some 
(subjective) measure of performance via a patching decision. This performance 
measure would depend on his understanding of the mission objectives. Some of the 
objectives of the RPV mission are: Don't lose an RPV, maintain ETA, maintain 
lateral position, maintain radio silence. We consider two alternative cost 
functions to help in arriving at a patching decision: 

Piecewise constant cost function 

C(e^) = if e a threshold set 

C(ei) = Ci if ei ^ e^ 

Quadratic Cost function 

C(ei) = ei' K ei 

The choice of e^ and K will be made based on facts of the type (i) and (ii) 
noted above. The costs C^, C^, C(e^) will be chosen to be functions of mission 
time to reflect the importance of ETA. As mission time gets closer to ETA for 
RPV-i, will be made larger and/or e| will be shrunk to reflect "urgency". 
The optimal patch decision will be chosen to minimize the expected cost using 
subjective probabilities computed with the help of the information processor. 
The details are in Appendix B. 

3«3«3 Patch Control Computation and Generation 

Once a decision is made tc patch a particular RPV-subsystem, it is 

necessary to compute and execute the patch control. The purpose of a patch 

control is to guide the aircraft from its initial location and heading to 

intercept and fly along the planned flight path. Various criteria may be 

considered to compute the optimal patch control, for example, a strategy that 
minimizes the time to return to the planned flight path (see appendix A and also 
reference 15). 


h. IHPLEHBNTATIOM THE HODS. 

DEMON, the combined monitoring, decision, and control model of the human 
operator is being implemented in FORTRAN. The program ha.*, a modular structure 
to facilitate ease of adding further modules to Include alternative monitoring, 
control, and decision strategies that may appear promising at a future date. 

To accomodate the random aspects of the problem, the program will basically 
have a Monte-Carlo simulation character. The specialized version of DEMON for 
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observed via a single lateral deviation display and 
patches issued previously by the operator for that RPV. 
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7. APPEMDIX 1: PATCH CONTROL STHATECI 
7,1 Systea Oynaaics and Patch Coaputation 

Tn <5«/*Hon ^ the N-RPV system dynamics were considered in general terms. 
Here ielhaU f simple model fo/ the RPV-subsystem dynamics and derive a 

specific patch control strategy. Considering only the i"/?® 

hSriLntal plane we shall re-write the normalized equations of motion derived in 





Figure 4 . Choice of Co-ordinates for System Equation 
reference 15 . using the state y«.i.bles(see figure 4 ) ^ 

X2 s cross-track error, X3 * velocity component along track, X4 - neaaing 
relative to track: 

a cos X4 - 1 , xi(0) given, x^d) free 

i2 s sin X4 , X2(0) given, X2<T) *0 

43 au sin X4 , X3(0) given, X3(T) a1 

44 a-u , X4(0) given, X4(T) aO 

T free 

x| + x^ a 1 

Once a decision is made to patch a particular RPV-subsystem, it is 
necessary to compute and execute the patch control. The purpose 
control is to guide the aircraft from its initial location and hiding to 

intercept and fly along the planned flight path. °**i^**^J® “A 

considered to compute the optimal patch control. Many criteria may 

in the form, A 9 v 

J a l/ 2 Kix^(T) + I/2K2 J x| dt ♦ K3 J dt 

uhleh is a weighted sum of the square of the ground speed error, integral square 
of the cross-track deviation, and time to return to the planned 
shall only solve the special problem of minimis time to return to the flight 
path by choosing the weights to be Kia0aK2 and K3ai. 
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7*2 Miniaua Tiae Patch Strategy 

Using the necesssary conditions for minimum time it is easy to see that the 
optimal control is Bang-Bang except for possible singular arcs. It can furthur 
be shown that the singular control is Identically zero. 



Figure 5. Minimum Time Patch Control Strategy 

The computed minimum-time patching strategy is Indicated in Figure 5. For 
example, all points in state apace that can be brought to the planned flight 
path using a single left tu^'n usi are characterized by the equation xoCO) = cos 
X4(0)-l . 

The minimum time required for the patch will be checked against the 
scheduled hand-off times for the given RPV to detennine if the computed patch 
should be executed. Velocity patches to correct for ETA errors with due regard 
to fuel constraints may be Included by a simple extension of the above problem 
(for example, append to the minimum time patch a velocity patch to minimize ETA 
errors) . 

The operator does not observe the states x directly, and will base his 
control actions instead on the best estimates of these states available to him 
based on all his observations. This disjoining of estimation and control is 
justified by the "separation principle" (see reference 17). 

8. APP8IIDI1 B: PATCH DBCISION STHATBGT 

6, 1 Introduction 

In this appendix we shall formulate and solve the combined monitoring and 
patching deoison problem encountered by the enroute operator in the RPV mission. 
As stated in section 3, the information processor produces the current estimate 
X of the true status- x of all the RPVs at any time. It also produces the 
variance of the error in that estimate. The information available for making 
monitoring and patching decisions may be summarized in terms of the posterior 
distribution of x^ conditioned on all observations based on past monitoring and 
patching decisions and control. Under the usual assumptions, this posterior 
distribution for xi is N(xi, X^i). 
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Let denote a threshold set associated with the i>th RPV, that is, x^c 
x^ is a desirable condition. Let denote the hypothesis that x^ and 
be the probability that is true. P^ is easily calculated using the available 
information on the posterior distribution of x^: 

pi = 1 - H(xl , Xll) dxl 

4 

^nitoring the i-th RPV results in a tighter distribution for x^ around its mean 
xi because it reduces the uncertainty xH associated with x^. Patching the i-th 
RPV requires monitoring as well. The effects of patching are: first, to 

correct the error el tdiioh might have 'wandered off from zero due to 
disturbances, by assuring that xl c x^; and second, to provide a tighter 
distribution of xl around its mean xl. 

To formulate and solve the combined monitoring and patching decision 
problem, we shall assume that is the cost if r 1 is true. Recall that rI has 
a (subjective) probability plof being true. Just as H^, pl, Cj[ were defined in 
relation to the set x^, let Rj, Pl, be defined in relation to the set x^, the 
complement of x^. We shall use minimum expected cost EC(d*) as the criterion 
for selecting the best monitoring and patching decision d*. 

Lot d^j denote a decision to monitor RPV-i and patch RPV-j in the combined 
monitoring and patching decision problem. Since a patch can be done only on a 
monitored RPV, there are only 2N-f1 available decisions. They are: 

(i) Do nothing decision doQi that is, monitor no RPV and patch no RPV.* 

(ii) N pure monitoring (no patching) decisions djQ, jsi,2,..,N. 

(ill) R patching (and monitoring) decisions djj, 3-1«2,...,N. 

Let P^j|( denote the probability that the hypothesis is true when the 
decision is dj|(. Because the RPV subsystems are non- interactive, it follows 
that the probabilities associated with RPV-i when some other RPV is monitored 
and/or patched is same as that associated with RPV-i when no RPV is monitored. 
That is. 




/ 


**i00 * Pijk 1*1, 2,..., N; ksj or 0 

Thus, there are only 3N distinct probabilities to be computed 

(i) N probabilities Pj^oo associated with do-nothing decision dQQ 

(ii) N probabilities Pj^^Q associated with pure monitoring decision d^o 
(iil)N probabilities Pij^i associated with patching decision dj[i 

Let (PP)i denote the probability that the patch decision d^j "takes”, that is, 
results in x^ x^, and let Tjj denote the cost of implementing decision djj. 
The costs Tij will be chosen to be functions of mission time to reflect the 
importance or ETA. As mission time gets closer to ETA for RPV-i, T^j will be 
made larger and/or x^ will be shrunk to refleot "urgency". 


* This could correspond to performing some other task such as communication. 
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The combined monitoring and patching decision p<r*oblem is described in terms 
of a decision- tree diagram in Figure 5.* The actual cost of a particular 
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Figure 6. Decision Tree Diagram for Combined Monitoring and Patching 

decision depends on the path chosen to traverse the tree from level 1 to level 
5. The exact path from level 1 to level 5 for the N-BPVs are determined both by 
the deoislon maker (the human operator) and by Nature (the random elements in 
the problem) . Since a decision has to be made at level 1 before Nature has 


{ 


taken its course at the monitoring level 3 and at the patching level M , the 
decision maker can only evaluate his 2N+1 alternative decisions in ten.c< of 
their expected costs. This he can do as follows. The expected cost of the 
do-nothing decision doo ^9 

EC(dgo) =^1 (Ci Pjoo + Cj Pioo^ 

Expected cost of pure monitoring decision djg is 

BC(djo) s 

Expected coat of a patching decision djj is, 

EC(djo) = ^‘'^‘*OP-"^^j^jOO+^,lPjOO)+(CjPjjj+CjPjjj'-(PP P Tjj) 

The optimal decision d* is the one which results in maximum opportunity galni 
that iS(* 

d* s arg min ( EC(doo. EC(dao), EC(di{i{) ) 

where 

■ . «rg M,] «CjPjoo.CjPjoo)- (CjPjJO^CjPjJo)- Tjo ) 

k . «-g MXJ ((CjPjoo<=jPjoo)*(CjPjjj.i!,iPjjj)-(PPjPjjj;cj.Cj).i^j) ) 

Consider a specialisation of the above decision prc/biem where the 
probabilities Pi«{ are assumed to be independent of the decisions dj^ (that is, 
Pljk * ^i) f coats Cj and Tj« are all zero, and the patch success 
probabilities (PP)i*1 for each subsystem RPV. Then the optimal decision is 

* ‘*JJ 

where 

J = arg maxj (Pj. Cj^) 

This is the result obtained by CarbonelK reference 12). 

An implicit assumption made in the computation of expected cost in the 
combined monitoring and patching decision problem is that the costa are constant 
over the entire se^<s 3?j and xj* This assunption is easily dropped when 
non-constant cost functions are desired, e.g., 

C(ei) s ei’ M ei 

In such a case, Ptj|<Cj in the above analysis will be replaced by an appropriate 
integral which would yield PijjtCi as a function of x^ and and appears 
amenable for computations. 


• Th. oouttpo arg. min. UpUaa d».aoo or or d^ dapandtag oa ahieh of tha 
three values EC(dor ), EC(dg,o), ®C(dj{i() is the smallest. Here dno ie the best 
monltorin? decision and d)(|( is the best patching decision. 
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We close this appendix, ’.rith an example of a piecewise-constant cost 
function that appears meaningful f'^r the N-RPV system under study. Recall from 
appendix A that the first two components of x^ are: 

= ground speed error (along track) 

x^ = cross-track error 

One choice for the piecewise-constant cost function is: 

C(ei) = 1 if |x|l > x^T *250 

s 0 if Ixji i 250 
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